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We consider general settings of Bell inequality experiments with many parties, where each party chooses from 
a finite number of measurement settings each with a finite number of outcomes. We investigate the constraints 
that Bell inequalities place upon the correlations possible in a local hidden variable theories using a geometrical 
picture of correlations. We show that local hidden variable theories can be characterized in terms of limited 
computational expressiveness, which allows us to characterize families of Bell inequalities. The limited com- 
putational expressiveness for many settings (each with many outcomes) generalizes previous results about the 
many-party situation each with a choice of two possible measurements (each with two outcomes). Using this 
computational picture we present generalizations of the Popescu-Rohrlich non-local box for many parties and 
non-binary inputs and outputs at each site. Finally, we comment on the effect of pre-processing on measure- 
ment data in our generalized setting and show that it becomes problematic outside of the binary setting, in 
that it allows local hidden variable theories to simulate maximally non-local correlations such as those of these 
generalised Popescu-Rohrlich non-local boxes. 



I. INTRODUCTION 

Quantum mechanics is incompatible with a classical view of 
the world in many ways. In particular, quantum theory is in- 
compatible with the assumption of local realism as there are 
entangled quantum states and measurements that lead to viola- 
tions of a Bell inequality [ 1 1 . Consequently, Bell inequalities 
provide an extremely clear distinction between classical, lo- 
cal hidden variable (LHV), and non-classical (e.g., quantum) 
theories and so have been studied with great fervor since their 
discovery. 

Another motivation for the study of Bell inequalities has 
been from an informational point-of-view. Quantum informa- 
tion has brought new insight into the very nature of quantum 
mechanics and Bell inequalities have been used to gain insight 
into the information processing power of quantum mechanics. 
Bell inequality violations have been used to guarantee the se- 
curity of quantum key distribution [2 |, generate randomness 
[3 1, give an advantage in communication complexity [4] and 
non-local games [5 |. Recently, connections have been made 
between quantum computing and a violation of a Bell inequal- 
ity as giving some computational advantage J6HH1. 

Despite their utility, there are many open questions about 
Bell inequalities. Many breakthroughs have been made by 
constructing Bell inequalities for many parties each with a 
choice of two measurements and two measurement outcomes 
ll9 fflD . However, constructing Bell inequalities (that fully de- 
fine LHV correlations) in general is NP-hard in the number of 
measurement choices and measurement outcomes at each site 

ma. 

Generalizations of the Bell inequality experiment away 
from the two measurement setting, two measurement outcome 
scenario have been studied and have led to the discovery of 
interesting phenomena [ 13-18 1. For example, quantum vio- 
lations can be greater |[T4ll and more robust to experimental 
imperfections |fl9l in these general scenarios. 

The Clauser-Horne (CH) inequality GDI also showed that 
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FIG. 1: In a single run of a general Bell experiment, n parties each 
make a measurement from c possible choices, where each measure- 
ment has d possible outcomes. Labelling the jth parties' measure- 
ment choice and outcome by Sj and rrij respectively, we can describe 
each run of the experiment with n-digit strings m and s. 



the full probability distribution (including marginal probabili- 
ties) can be constrained in LHV theories and violated by quan- 
tum mechanics. However, quantum theory does not obtain the 
maximal violation of the CH inequality possible in theories 
that do not allow superluminal signaling (non-signaling for 
short), as demonstrated by Popescu-Rohrlich boxes ED . As 
a result, much work has gone into characterizing the space 
of non-signaling probability distributions El . An approach 
based around convex polytopes has been particularly fruit- 
ful but the space of all non-signaling probability distributions 
remains complicated and often counter-intuitive, as demon- 
strated by the Guess Your Neighbor's Input non-local game 
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The convex polytope of non-signaling probability distribu- 
tions in the CH inequality setting is well-studied and is an 
excellent platform for describing quantum correlations via an 
information theoretic, or physical principle [24 -27 1. For more 
general settings, it is difficult to describe the space of non- 
signaling probabilities. To avoid this difficulty yet still obtain 
some intuition about the space of non-signaling probabilities, 
we consider the space of correlators, i.e., the space of proba- 
bility distributions over joint outcomes. 

For two-setting, two-outcome Bell inequalities, the correla- 
tions studied in Bell experiments can be described in terms 
of stochastic Boolean maps, e.g. probabilistic mixtures of 
Boolean functions from the two bits describing the measure- 
ment settings to the parity of the measurement outcomes, 
which is a single bit. This approach has proven well-suited to 
the study of many-body generalizations of the Clauser-Horne- 
Shimony-Holt (CHSH) inequality and led to a number of uni- 
fications and new insights (6|-[8). 

In this paper, we generalize this approach by replacing 
stochastic Boolean maps with stochastic maps from the set 
of measurement settings to a single digit representing the par- 
ity of the output digits. Our generalization reveals new phe- 
nomena which do not occur in the simpler two-setting, two- 
outcome case. 

Hand-in-hand with this "computational" description of cor- 
relations we shall use a geometric approach. The list of con- 
ditional probabilities describing the stochastic map can also 
be treated as a vector in a real vector space [12, 28, 29) . In 
this picture, the valid sets of correlations which occur in a the- 
ory form a convex region in this space, which, in the case of 
LHV theories, is a polytope. A polytope can be defined as the 
intersection of the half-spaces that satisfy a set of linear in- 
equalities, which, for the LHV polytope, are simply the facet 
Bell inequalities. 

Very recently, two of us showed that for 2-setting 2- 
outcome Bell inequalities, computational expressiveness pro- 
vides an elegant method for studying Measurement-based 
Quantum Computing Q E). We shall show how such ap- 
proaches can be generalized, and that the switch from Boolean 
to more general maps allows more complex behavior than in 
the binary case. 

In Section|n] we outline the general framework, fix notation 
and introduce the modular arithmetic which will be employed 



throughout the paper. In Section III we construct Bell inequal- 
ities in this framework. In Section |IV] we study how methods 
from [3 and flU can be generalized and then summarize in 
Section [V] 



II. FRAMEWORK FOR GENERAL CHSH TESTS 

The general scenario we consider is the n-partite Bell exper- 
iment shown in Fig. [T] where n > 1. For simplicity, we 
assume all space-like separated parties choose from c possi- 
ble measurement settings, each with d possible outcomes, so 
we label an experiment by the variables (n, c, d). 

As in the standard CHSH experiment, we assume that each 
party's measurement setting is chosen randomly from a uni- 



form distribution and is uncorrected with the state of the sys- 
tem (see [30 - 33 1 for some consequences of dropping this as- 
sumption). 

We label the c possible measurements and d possible mea- 



surement outcomes by digits Sj 6 {0, 



— 1} € Z c and 



rrij 6 Zd respectively. Therefore the n settings and measure- 
ments for each repetition of the experiment are labeled by n 
digit-strings m e and s € Z" corresponding to all n mea- 
surement outcomes and all n settings respectively, where x 
denotes the n-digit string such that the j-th digit is Xj. 

To simplify the analysis we shall assume initially that both 
c and d are prime. We discuss features of the analysis in the 
non-prime case in appendix [B] We will employ arithmetic in 
both modulo c and modulo d. As we are considering corre- 
lators obtained by adding measurement outcomes, the major- 
ity of the arithmetic will be modulo d. Therefore we use the 
following convention. In all expressions, arithmetic will be 
modulo d, except in exceptional cases where the arithmetic is 
modulo c, which will be denoted with large square brackets 
and subscript c, i.e. [• • • ] c . 

After the data m and s has been collected from many repeti- 
tions of the experiments, conditional probabilities p(m|s) for 
each value of m and s can be calculated. In the initial CHSH 
paper, and in many-body generalizations, rather than study- 
ing this full probability distribution, one merely considers the 
statistics of the parity of the output m. Here, we shall take a 
similar course and study correlators 



p(k\s) =p{^m.j = fc|s), 



(1) 



where p(E"=i "*j = fc|s) = £ m | E » =i ro . =fc p(m|s). When 
taking a sum over rrij we are performing addition modulo 
d but when taking sums of probabilities or describing any- 
thing that is not related to measurement settings or outcomes 
we are performing standard, natural arithmetic over the re- 
als. This natural generalization of the CHSH-type correla- 
tors in ([T| also encompasses the well-known Collins-Gisin- 
Linden-Masser-Popescu (CGLMP) inequality for two parties 
lfl4ll . These correlators have a well-defined role when con- 
sidering the expectation values of joint measurements (35). 
Moreover, singling out this one parameter simplifies the prob- 
lem by reducing the dimensionality of the problem, whilst still 
capturing interesting properties of the full distribution p(m|s). 

In Section Hi Al we introduce the mathematical framework 
for describing these correlators, which employs a correspon- 
dence between conditional probabilities and stochastic maps. 



In Section II B we apply this framework and derive the full 



family of correlators that are possible in LHV theories. Then, 



in Section II C we consider correlators in more general non- 
signaling theories and derive generalizations of the Popescu- 
Rohrlich non-local box ETl . 



A. Correlators as Stochastic Maps 

Before we begin our study of correlators, we need to introduce 
some notation and mathematical construction. The correlators 
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p(k\s) are maps from n digit-strings s £ Z" to probability 
distributions over a single digit k £ Z^. Any stochastic map 
is a convex combination of deterministic processes, which in 
this case are functions / : Z™ — > Z^. 

Such functions take an "input" s <E Z™ to a single "output" 
G Z(j. Since Z^ and Z™ are fields for c, d prime, these func- 
tions can be represented as elements of a vector space. There 
are many bases one could choose to represent the function. 
Perhaps the simplest and cleanest basis is the set of Kronecker 
delta functions 8\, where y £ Z™, which equals 1 when y = s 
and otherwise. Any function can be represented as 



/w = E fW 



(2) 



The d c " possible coefficients correspond to <i c " functions. It 
will sometimes be convenient for the basis set to include the 
constant function. We shall then replace the delta function 6° 
where C is the all-zeros string with the constant function to 
write 



vye(z ? -o c ) 



a, 



(3) 



where e y = (/(y) 



a) and a £ Z^. 
The natural expression for the computational power of cor- 
relators in a general theory is how close they can approximate 
an arbitrary function. Therefore we want to be able to rewrite 
Q as a polynomial. This can be done as the Kronecker delta 
8* can be written as a polynomial (modulo c), 



S y = IK II 1 - 

J = l 3=1 



n 



(c-l) 



i=0 



i e< i<r,')^w 



(4) 



The first line follows from Fermat's little theorem, as a 6-1 = 
1 mod b if a is non-zero and b is prime. To reach the sec- 
ond line we have used the binomial theorem which follows 
from the distributivity of modular arithmetic. This shows us 
that any function /(s) can be expressed as a polynomial with 
mixed modular arithmetic systems. Furthermore, when d > c 
we can write the function solely in terms of modulo d arith- 
metic since equation Q holds for any prime > c. 

In this paper, the following class of functions, which we 
term the "n-partite linear functions", will be crucial in the 
study of LHV theories. 

Definition 1. A function /(s) : Z" — > Z^ is an n-partite 
linear function if it can be expressed as 



(5) 



where addition is modulo d and the gj(sj) are functions 
Z c — > Zrf of a single variable. If /(s) is not an n-partite linear 
function, we refer to it as a non-n-partite linear function. 



These functions will play a role analogous to that of the lin- 
ear Boolean functions in [7, 8 |. The n-partite linear functions 
can also be written as 



/(») = Oi + 



n (c-l) 

E E ft.-*? 



(6) 



with a, f3j. a £ 7L c i- There are d n ^ c ~ 1 ' +1 n-partite linear func- 
tions. For c being prime, the key feature of a non-n-partite 
linear function is the presence of cross-multiplicative terms in 
Q between different digits Sj of s. 

For non-prime c, we cannot write delta functions in the 
neat polynomial of Q but in a more elaborate fashion (see 
appendix [Bj but we can still work in terms of the delta func- 
tions for non-prime c. Therefore, the definition of an ?i-partite 
linear function in |6]) applies for non-prime c. Thus for prime 
and non-prime c, the delta functions Sa 3 only singularly define 
a value of /(s) when there is only one non-zero element in s. 
For other values of s there is addition modulo d between the 
terms Si 3 . Equivalently then, for /(s) to be a non-n-partite 
linear function, there must be at least one delta function S* in 
[3]with y being a digit-string with more than one non-zero ele- 
ment. A function f(s) can be written in terms of an n-partite 
linear part plus a non-n-partite linear part (with addition mod- 
ulo d), and if the n-partite linear part is zero, then f(s) = 
for all s with only one non-zero element. 

Finally, we note that the above treatment also applies in 
a more general setting where instead of setting digit-strings 
s £ Z™ we now have digit-strings s £ Z Cl x Z C2 x ... x Z Crl = 
(Bj=i ■ That is, the number of possible settings, now la- 
beled Cj at each jth site could now be different from other 
sites and also non-prime. The string s is now a direct sum (or 
Cartesian product) of the registers Z Cj for each site. All delta 
functions as defined can be carried over to this general set- 
ting. Thus single site maps and n-partite linear functions are 
exactly the same as |6| but for the jth site, c is replaced with 
Cj. We now discuss a geometrical picture of the correlators 
p(k\s). 

The correlators p(k\s) are stochastic maps and so must sat- 
isfy the positivity inequalities, i.e., the set of c n (d — 1) linear 
inequalities p(k\s) > 0, for all k, s. Furthermore, as some out- 
come always occurs for any choice of measurement settings s, 
the correlators must also satisfy the normalization equations, 
S/cez d P(k\ s ) = 1 f° r a ^ s - W e then only have (d — 1) inde- 
pendent correlators for each value of s, so we omit the corre- 
lator p(0|s) and treat the remaining correlators as elements of 
a vector in a c n (d — 1) -dimensional real space with elements 
being p(k\s) for k ^ 0. 

The space of correlators satisfying the linear inequalities of 
the positivity conditions p(k\s) > and normalisation condi- 
tions in this reduced space, J2k^oP(^\ s ) — 1> ^ s tnen a con " 
vex poly tope labeled V. It is a convex poly tope as it is the 
intersection of the half-spaces defined by these linear inequal- 
ities. In any specific theory, the region of allowed correla- 
tors, T, will be a convex subregion of V. Equivalently, V can 
be defined as the convex hull of all deterministic correlators 
p(k\s) £ {0, 1}, for all k and s; these deterministic correlators 
are the extreme points of V. 
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If T has a finite number of extreme points (which is true 
for LHV theories but not for quantum theory), then T will be 
a convex polytope and so can be described as the intersection 
of the half-spaces defined by a set of "facet-defining" linear 
inequalities. Facet-defining inequalities are defined in terms 
of affinely independent points, where a set S of K vectors pi, 
S = {po,Pi, ■■■tP(k-i)} is affinely independent if for every 
Pk € S, the (K — 1) vectors in the set {pi — Pk\Pi 7^ Pk} are 
linearly independent [34|. 

Definition 2. A linear inequality is facet-defining for a con- 
vex polytope in M. D when at least D affinely independent ex- 
treme points saturate the inequality (i.e. satisfy the equality of 
the linear inequality). 

Having constructed a framework for stochastic maps (and 
equivalently, conditional probabilities) we can immediately 
apply this to the correlators studied in this paper. Recall that 
we will focus our attention on correlators p(k\s) represent- 
ing the probability distribution on k = Y^j=i m j' me sum 
modulo d of the outputs. The correlators have the form of a 
conditional probability and thus a stochastic map, and there- 
fore are characterized by vectors in the space V n ^ c ^d- The 
space V n ,c.d represents the set of all correlators possible in 
principle, but for certain theories, not all correlators will be 
permitted. The focus of our study is then the regions of cor- 
relators which are accessible in LHV theories and in quantum 
theories. The region of LHV correlators is itself a polytope, 
and the facet-defining inequalities represent the facet-defining 
Bell inequalities |fT0Hl2l |3^1 . This is the geometrical picture 
of Bell inequalities that we will consider in the next subsec- 
tion. We now study and characterize the region of correlators 
in LHV theories. 



B. Local Hidden Variable Theories 

LHV theories are theories in which the measurement out- 
comes at each site can depend on the input setting of the mea- 
surement. The only other parameter the outcome depends on 
is an objective "hidden variable" A € A which we assume is 
shared by all parties, and where A is the (often continuous) 
set which defines the possible values for this variable. We 
do not assume that A is deterministic, but allow it to satisfy 
a probability distribution p(X)d\ over the space A such that 
J A p(X)dX = 1 . Without loss of generality [36], we assume 
that apart from the probability distribution over A the internal 
workings of each measurement device is deterministic. This, 
via a standard argument, results in conditional probabilities 
for the set of outputs m of the following form, 



P (m\s) = / dXp(X) J] 



p(m,-|sj-,A). 



(7) 



Since A is the only non-deterministic element to consider, we 
can study the LHV correlations as the convex combination of 
the set of possible deterministic maps. We now prove that 
the map from measurement settings to correlators in any LHV 
theory is a probabilistic combination of n-partite linear func- 
tions. 
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FIG. 2: Schematic for correlators in LHV theories versus more 
general theories (including quantum mechanical correlators). The 
shapes represent the convex polytopes of general correlators and 
LHV correlators, i.e. the space C The points on each shape are the 
deterministic maps corresponding to the particular functions listed. 
We have shown the region of quantum correlators lying between the 
convex polytope of n-partite linear functions and the bigger con- 
vex polytope of all possible (including non-n-partite linear) maps. 
The facet Bell inequalities define the region C of LHV correlators 
whereas the non-trivial Bell inequalities (as mentioned in section|ni} 
just bound the region. It is preferable but very hard [ 12] to find facet 
Bell inequalities rather than just non-trivial ones. 



Theorem 1. In an n-party generalized Bell experiment, with 
c settings and d outputs for each measurement, the region C 
of correlators accessible in a LHV theory is the convex hull of 
the set of n-partite linear functions. 

Proof. The space C of correlators is a convex polytope, we 
only need to find the extreme points of the polytope, or de- 
terministic correlators, and take their convex hull l36l . This 
means we just need to find the deterministic functions possi- 
ble with LHV correlators. These deterministic functions cor- 
respond to adding the measurement outcomes of space-like 
separated sites, which, for a LHV theory, is deterministic if 
and only if the maps from the measurement setting Sj to the 
measurement outcome rrij at each site is deterministic, i.e., of 
the form g-j{sj) in Definition [T] for all j. Adding such func- 
tions modulo d gives a n-partite linear function. □ 

This theorem generalizes the theorem valid for the (n,2,2) 
case presented by two of us in [8 1. It shows that C is com- 
pactly characterized by considering it in terms of the com- 
putational expressiveness of LHV correlators (see Fig. |2]for a 
schematic of the consequences of Theorem 1). In other words, 
if one considers the Bell experiment as a computation, LHV 
theories allow the computation of n-partite linear functions 
and nothing more. From this perspective, LHV theories have 
limited computational power, and thus a theory that allows 
correlators outside this region can be perceived as having a 
computational advantage in a Bell experiment. 

As mentioned above, there is a dual description of a convex 
polytope in terms of linear inequalities. The convex polytope 
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of LHV correlators L is the intersection of the half-spaces 
described by the set of facet-defining Bell inequalities (or 
facet Bell inequalities for short). In Fig. [2] we illustrate the 
different types of Bell inequalities which one can construct, 
and the terminology to classify them. Bell inequalities that 
are equivalent to normalization or positivity conditions (i.e 
the boundaries of V) are referred to as trivial Bell inequal- 
ities. There remain Bell inequalities that are neither neces- 
sarily facet-defining or trivial, which we call non-trivial Bell 
inequalities; they are non-trivial because they indicate a sepa- 
ration between the polytopes V and C by bounding the latter. 

If a correlator is outside C then it must violate one of the 
facet Bell inequalities. For example, it is well-known that 
some quantum correlators violate a Bell inequality [1J. Also 
if we observe a violation of a non-trivial inequality then we 
know that it is outside of C. To recapitulate, for either non- 
trivial or facet Bell inequalities, we associate a violation with 
a computational advantage and in Section|lII]we consider Bell 
inequalities from this computational perspective. 

Before investigating Bell inequalities, it is worth consider- 
ing the relation between p(k\s) and p(m|s). Since the latter 
is more general than the former, we are excluding interest- 
ing phenomena by only considering correlators. Examples 
of interesting phenomena in the full p(m|s) setting include 
the 13322 inequality [18| which has interesting implications 
with regards to quantum correlations; another example is the 
Guess Your Neighbor's Input [23 1 non-local game which re- 
sults in some facet Bell inequalities that are not violated by 
any quantum correlators. In spite of these examples, we ar- 
gue in the next subsection that the correlators p(k\s) capture 
many important phenomena of full non-signaling probability 
distributions. In particular, we show that particular determin- 
istic correlators can be uniquely associated with a single non- 
signaling distribution, that being a many-body generalization 
of the PR non-local box ETI . 



C. Non-signaling theories 

In this section, we shall consider general non-signaling theo- 
ries. These are theories in which there are no constraints on 
correlations other than the no-signaling conditions. First, note 
that given any deterministic correlator p(k = /(s)|s) where 
k = J^j m j' there exists a non-signaling distribution such 
that p(rrij\sj) = A and the non-signaling distribution results 
in the correlator. This is an immediate consequence of Theo- 
rem [2] proven below, but can be understood intuitively since 
every possible value of rrij is consistent with one of more out- 
put strings m where Y2j m j = /( s )- One can then take a 
suitable mixture of these strings such that every output digit 
is maximally uncertain. Since a maximally uncertain output 
carries no information about s, the no-signaling condition is 
automatically satisfied. 

We denote the space of probability distributions p(m|s) that 
satisfy the no-signaling conditions by MS n _ c .d (the indices 
will normally be omitted for clarity). This space is a convex 
polytope as it is the intersection of the half-spaces defined by 
the linear inequalities being the normalization, positivity and 



non-signaling inequalities (see [22] for details). Finding the 
vertices of this polytope is the dual problem of finding the 
facet-defining linear inequalities corresponding to some ver- 
tices (e.g. finding the facet Bell inequalities) which is an NP- 
hard problem. We now discuss some of the vertices of MS 
without resorting to this vertex enumeration problem in the 
following discussion. 

A famous example of a non-signaling distribution which 
cannot be simulated by LHV or quantum correlators for two 
spatially separated systems is the PR non-local box. The PR 
box is defined as a black box that results in measurement out- 
comes according to the probability distribution, 

t | \ )\ if mi + m 2 = sis 2 , 

p{mi,m 2 \si,s 2 ) = < * . (8) 

I otherwise. 

The PR box is described by the correlator p(k — sis 2 |s) = 1. 
In other words, the PR box is associated with a correlator in 
Vz, 2. 2- This correlator is a vertex of 7^2.2,2 corresponding to 
the deterministic property that the parity of outputs mi and 
rn 2 is equal to the product of the inputs Si and s 2 . Further- 
more, the PR box is the only distribution in A/\S 2 ,2,2 which is 
compatible with the correlator p(k = s\S2\s) = 1. In other 
words the vertex correlator p(k = SiS2|s) = 1 in V uniquely 
defines a full probability distribution in MS. 

In this section, we will show that this property is shared by 
many vertices in V. These vertices are only compatible with a 
single probability distribution in MS. In particular, for n = 2, 
every correlator in V corresponding to a non-rt-partite linear 
function is only compatible with one probability distribution 
in MS. 

We emphasize that the full set of vertices of MS for arbi- 
trary c, d and n has never been characterized, and its numer- 
ical computation is an NP hard problem (the vertex enumer- 
ation problem) ll22l . For example, in the (3, 2, 2) setting, the 
space NS has been shown to have a complicated structure (37). 
Therefore, we are able to capture a large amount of structure 
without resorting to optimization. 

Before we define this structure we need to introduce a new 
class of functions /(s) : Z™ — >• Z^ that we will only use in 
this subsection to find a subset of the vertices of MS. We 
call these functions "bi-partite linear" as they correspond to 
functions for which, if the observers gather into two groups 
at two space-like separated sites, the measurement outcome 
results from an n-partite linear function for n = 2. 

Definition 3. A bipartition, {A, B} of the set of n parties is 
a division of the set {1, 2, . . . , n} into two disjoint, non-empty 
sets A and B such that A U B = {1, 2, . . . , n}. 

Definition 4. A function /(s) : Z™ — » Z^ is bi-partite linear 

if for any bipartition {A, B}, /(s) can be written as 

f(s) = f A ( S A ) + f B ( S B ), (9) 

where s x is the | X | -digit string with entries s x = Sx j and 
f x (s x ) is a function on s x . 

We now show that there exists a unique non-signaling dis- 
tribution corresponding to a vertex p(k — /(s)|s) of V if and 
only if f(s) is not bi-partite linear. 
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Theorem 2. For every function f(s), the corresponding ver- 
tex p(k = f(s)\s) = 1 of V is compatible with the non- 
signaling probability distribution 



p(m\s) 



otherwise. 



m j 



/(*). 



(10) 



Furthermore, this is the only non-signaling distribution com- 
patible with p{k — f(s)\s) = 1 if and only if f(s) is not 
bi-partite linear. 

Proof. One can verify by inspection that this distribution is 
non-signaling (since all marginals are maximally uncertain) 
and satisfies p(k = /(s)|s) = 1. It remains to be shown 
that the distribution in (jTOf is unique if and only if /(s) is not 
bi-partite linear. To prove the if statement, note that for any 
bi-partite linear function /(s), the non-signaling distribution 



P(m|s) 



' d 2-\A\-\B\ =d 2-n if £ 

and zJjGB '"I 



/V) 

mi = / 2 (s B ) 



otherwise, 



(11) 

is compatible with p(k = /(s)|s) = 1. We prove the only if 
statement in Lemma[3] □ 

Note that local operations, which, for the Bell experiment 
setting, corresponds to adding functions g'j(sj) to the mea- 
surement outcomes at each site, do not change whether or not 
a function is bi-partite linear. Since every function can be 
written in terms of an n-partite linear part and a non-n-partite 
linear part, local operations can create the ?i-partite linear part 
and so without loss of generality this part can be set to be zero. 
Therefore it is sufficient to consider functions f(s) such that 
/(t) =0 for all n-digit strings t G Z™ that have at most one 
non-zero digit. 

We can separate the parties using any bipartition A and B 
and treat the sets of parties, A and B, as single parties. To do 
this, we take the sum modulo d of all outcomes on each side 
of the partition and set the number of settings on each side 
of the partition to be the product of the number of settings of 
the parties on that side of the partition (e.g., ca — YiieA c *)- 
For a function that is not bi-partite linear, for all bipartitions 
into A and B, the function / : Z CA x Z CB — > Id is non- 
ri-partite linear for n = 2. Therefore it is sufficient to show 
that Theorem|2]applies for n = 2 and so it must apply for all 
bipartitions of n parties as long as a function /(s) i n (fl0| ) is not 
bi-partite linear. If the probability distribution in (|10|> applies 
for n = 2 then it must apply for all possible bipartitions of n 
parties thus generating the distribution ( fTO) for all n. 

Lemma 3. For every function f(s\,s 2 ) '■ Z Cl x Z C2 — > TLd 
that is non-n partite linear for n = 2, the only non-signaling 
distribution compatible with the corresponding vertex p(k = 
/( s ij s 2)|*) — linV is 



p(mi,m 2 |si,s 2 ) 




if mi + m 2 = /(si,s 2 ), 
otherwise. 



Proof. The condition p(k = /(sx, S2)|si, s 2 ) = 1 for 
all s = {si,S2} implies that for every value of mi in 
p(rrii, m 2 |si, s 2 ), there exists a unique value of m 2 = /(s) — 
mi. In computer science terminology we would call this a 
"unique game". This immediately implies the equality for the 
following conditional distributions: 

p(n%i — x, m 2 = /(s) — x\s) = p{m\ = x\%) 

= p{m-2 = /(s) - x\s), 

(13) 

for all x € Z^. The no-signaling condition further implies that 

p(mi — x\s) = p(mi = x\si) and 



p(mi = x\si) = p(m 2 = f(s) - x\s 2 ), 



(14) 



which must be satisfied for all s and all x. We will show that 
repeated application of ( fl4] > for varying s allows us to prove 
that all non-marginal probabilities are equal provided that /(s) 
has a non-n-partite linear element. As discussed above, we 
can set /(0, s 2 ) = f(s\, 0) = for all s = {si, s 2 }. For such 
functions, repeatedly applying ( fl4| i gives 

p(m 2 = -x\s 2 ) = p(mi = x|0) 
= p(m 2 = —x\0) 
= p(mi = x\si) 

= p(m 2 = f (8i,8 2 ) - x\s 2 ) (15) 
for all x. Repeated iteration implies 

p(m 2 = -x\s 2 ) = p(m 2 = a/(si, s 2 ) - x\s 2 ) (16) 

for all a G Id- The function /(s) is non-n-partite lin- 
ear so it must have at least one value of {si,s 2 } where 
f(si,s 2 ) is non-zero. Since d is prime, af(si,s 2 ) takes 
on all values in Z^, therefore the marginals are p(m 2 \s 2 ) = 
d^ 1 for all m 2 ,s 2 . Applying equation ( fl4| ) implies that 
p(mi, m 2 |si, s 2 ) = d" 1 for all m such that mi + m 2 — 

m □ 

As we now show, for vertices p(k = /(s)|s) = 1 of V 
such that /(s) is bi-partite linear, the existence of multiple 
non-signaling distributions implies that the vertex of V corre- 
sponds to a facet of MS. 

Proposition 4. Every non-signaling distribution correspond- 
ing to a vertex ofV lies on a facet of MS. 

Proof. Any non-signaling distribution p(m|s) can be written 
as a convex combination p(m|s) = ^ B p(i?)p£;(m|s) of the 
vertices E of MS, where \^2 E p{E) = 1. Therefore the asso- 
ciated correlator is 



p(k\s) 



J2p(E)sf Umi p E (m\ S ) 

E 



= Y / p(E)p E (k\s), 



(12) where p E (k\s) = 6 



E 



(17) 



from each extreme point of MS. 



Ps(m|s) is the correlator resulting 
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As the non-signaling distribution results in a correlator that 
is a vertex of V, p(E) — for all vertices of MS that do not 
result in the same correlator. Denote by £ the set of vertices of 
MS that correspond to the correlator and let T be the region 
of convex combinations of the elements of £. Any element in 
F must correspond to the correlator by (17) . 

T is not a facet of MS if and only if there exists a con- 
vex combination of the vertices not in £ that intersects F. 
However, if there exists such a convex combination, then this 
would give a convex combination of correlators in V that is 
equal to a vertex of V, which is a contradiction. □ 

The uniqueness of the probability distributions in Theorem 
|2]implies that each vertex p(k — /(s)|s) = 1 of V such that 
/(s) is not bi-partite linear corresponds to a unique vertex of 
MS (i.e., the "facet" in Proposition [4] collapses to a single 
point). 



Proposition 5. Every probability distribution of the form ( |10) 
is a vertex of MS if and only if f(s) is not a bi-partite linear 
function. 

Proof. As there is a unique non-signaling distribution corre- 
sponding to the correlator p(/(s)|s) = 1, there is at most one 
E such that p{E) ^ in[l7] □ 

The structure of the high dimensional space of all possi- 
ble non-signaling distributions can be partly revealed by con- 
sidering the space of correlators, which are of relatively low 
dimension. Note that the difficulty in ascertaining the remain- 
ing structure of MS comes from the difficulty of finding the 
set of vertices that generate vertices of V that correspond to 
bi-partite linear functions and determining whether this gives 
a complete list of vertices. Furthermore, much of the struc- 
ture of LHV theories in MS is reduced to a small number of 
vertices of V. In this way, the space of correlators efficiently 
encapsulates vital information about all non-signaling proba- 
bility distributions. 

There is a connection between the notion of "true rt-party 
non-locality" [22 , 38 1 and the vertices of MS defined by The- 
orem [2] The bi-partite linear functions corresponding to ver- 
tices in V only reveal "non-locality" (i.e. correlations not re- 
sulting from LHV theories) between a subset of all parties, but 
no non-locality across at least one bipartition. However, func- 
tions that are not bi-partite linear result in correlations that are 
non-local across all partitions. 



D. Summary 

In this section we have constructed a framework for Bell cor- 
relators and showed that these correlators capture important 
properties of the space of non-signaling probability distribu- 
tions. These correlators also capture the properties of LHV 
theories in a natural information theoretic description, i.e. that 
of computational expressiveness. This work generalizes pre- 
vious work of [D and shows where the structure in these 
previously studied settings emerges from. 

Given this framework, we want to look at the correlators 
that are achievable with quantum theory. It is well-known 



n 


c 


d 


# Vertices 


# Facet Bell inequalities 


2 


2 


2 


8 


16 


2 


2 


3 


27 


66 


2 


2 


5 


125 


1020 


3 


2 


2 


16 


256 


3 


2 


3 


81 


125,412 


2 


3 


2 


32 


90 



TABLE I: A table of number of facet Bell inequalities for each setting 
(n, c, d) and the number of vertices for the LHV polytope. 



that there exists quantum mechanical correlators that cannot 
be described by correlators resulting from an LHV theory, and 
this is demonstrated by the violation of a Bell inequality iffl . 
Therefore we now devote our attention to the Bell inequalities. 



in. BELL INEQUALITIES 

Bell inequalities allow us to characterize the polytope of LHV 
correlators. The normalization and positivity inequalities de- 



fined in Section II B define part of C, the LHV polytope. How- 
ever, these inequalities are also satisfied by all possible cor- 
relators p(k\s) and define the polytope V of all correlators. 
These inequalities then trivially bound C as we cannot show 
a separation between V and C. We want to demonstrate such 
a separation and non-trivially bound V, which we can do by 
constructing Bell inequalities of the form 



^ w fc , s p(fc|s) < 7/;, 



(18) 



for all p(k\s) G C, where Wfc iS and jc are rea l numbers. We 
call yp the maximum value of the sum on the left-hand-side 
over correlators in V. For the Bell inequality to be non-trivial, 
we require jc < TP • 

If a correlator exists outside the LHV polytope, then it vi- 
olates at least one facet Bell inequality. Finding facet Bell 
inequalities involves an optimization of the parameters u>k,s 
in ( fTS) and in general is an NP-hard problem [ 12 1. The prob- 
lem is a facet enumeration problem and there are algorithmic 
packages that can perform this optimization such as Polymake 
11391 . These optimization algorithms are often used to charac- 
terize the non-signaling ll22l and LHV polytopes [18] in full 
probability distribution settings. We used Polymake to find 
the facet Bell inequalities for a number of small (n, c, d) set- 
tings. The number of facet Bell inequalities for each setting is 
listed in Tab. [I] which shows that the number of inequalities 
grows rapidly in the size of the problem. 

For the case of (n, 2, 2), the LHV polytope is a hyperocta- 
hedron ifTOlfTTI . In general though, there is no obvious geo- 
metrical structure or connection between the numbers of ver- 
tices and the numbers of facet inequalities. It is quite apparent 
that finding the facet Bell inequalities is no easy task. 

In this section we will use the computational insight gained 
in Section |TT| to construct Bell inequalities. We first find a 
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simple way to generate non-trivial Bell inequalities that nec- 
essarily bound the LHV polytope. As we shall discuss further 



£ S1 



;9(si,S2) 



S 2 



< 3 as the functions g(si, S2) overlaps with 



in Section IV these non-trivial inequalities also have a nice 



cross-over with the structures in so-called non-local games. 
We then describe some of the facet Bell inequalities and show 
that they too have a computational nature. 

Finally, in the last part of this section, we discuss the quan- 
tum violation of facet Bell inequalities. We show that there 
is a violation for all facet Bell inequalities for n = 2 in Tab. 
[I] We comment on the quantum states that maximally violate 
these Bell inequalities and find that in few instances is it a 
maximally entangled state. This supports the view that entan- 
glement and a violation of a Bell inequality are not synony- 
mous concepts. 



A. Non-trivial Bell inequalities 

One might ask what is the point of looking for Bell inequal- 
ities that are not facet-defining? Firstly, bounding the set of 
LHV correlators still creates a non-trivial subregion of a corre- 
lator space. Secondly, we can still demonstrate non-classical 
behavior without the computational difficulty that the facet- 
defining condition creates. Finally, despite not being facet- 
defining, non-trivial Bell inequalities can have a role in partic- 
ular applications such as in non-local games or Measurement- 
based Quantum Computing (MBQC) (8), as non-trivial Bell 
inequalities might capture computational power. We will elab- 
orate on this final point in Section |TV"| 

In this subsection, we will discuss non-trivial Bell inequali- 
ties as being inequalities in the space V' of all possible corre- 
lators p(fe|s), including those for k = 0. In this space, the 
normalisation conditions become J2kP(^\ s ) ~ 1 f° r a ^ s ' 
which defines a bounded hyperplane in R dc . The space of 
correlators V' is a convex polytope defined by the positivity 
conditions within this hyperplane. We regard the region of 
LHV correlators, C, as a sub-region in both V and V' as one 
can always go between the two spaces by disregarding p(0|s) 
and applying the normalisation inequalities. In the rest of the 
paper, including the discussion on facet Bell inequalities, we 
always discuss correlators in the space V. 

Given the motivation for finding non-trivial Bell inequal- 
ities we now present a simple way of generating non-trivial 
Bell inequalities. We begin by considering the CHSH inequal- 
ity [9 1 as an example. The CHSH inequality can be written as 



5*p(k\s u s 2 )<3, 



(19) 



Si ,S2 fe=0 



which is ([18) with 7£ = 3 and u k<s = <J* Sa . The CHSH 
inequality is also a facet Bell inequality for this setting. 

By convexity, it is only necessary to consider the ver- 
tices of the LHV polytope to obtain the bound 7^ = 3. In 
this case, the vertices correspond to the linear boolean func- 
tions of Si and s 2 [7|. For these vertices the correlators 
are then p(k\sx,S2) = S2 ) where g(s±,S2) are the lin- 
ear Boolean functions on {si, S2}. The sum in ( fT9| ) is then 



the function S1S2 for at most 3 values of s. An example of 
a linear function achieving this overlap is g(si, S2) = 0. On 
the other hand, if a correlator p(k = /(s)|si, S2) achieves the 
map /(s) = S1S2 deterministically, then it achieves a value 
of 773 = 4 for the sum in ( [19) , This inequality highlights the 
inability for LHV correlators to evaluate non-linear Boolean 
functions deterministically. 

For all settings (n, c, d), we can generalize the above con- 
cepts from the Boolean algebra for (2, 2, 2) to our more gen- 
eral framework of maps / : Z" — > We construct non- 
trivial Bell inequalities in the following way: 



d-i 



k=0 



< sup V 5 



m 

9(s) ' 



(20) 



where the supremum is over the set of rt-partite linear func- 
tions and /(s) is a non-n-partite function of s. Therefore there 
are N = d c " — d n ( c ~ 1 ' +1 non-trivial Bell inequalities of the 
form (|20j. 

We can prove the inequality in ( pO) using the same ar- 
guments as with the CHSH inequality. By convexity we 
only need to consider the vertices of C which correspond to 
the n-partite linear functions g(s). The sum in < f20] > is then 



/W U 9(s)' 



We then take the supremum over all n- 



partite functions g(s) to obtain the right-hand-side of |20) 

Whilst these inequalities might seem contrived, they in- 
clude some interesting Bell inequalities. For example, the 
Svetlichny inequality 1 38 1 for three parties takes the form of 
pOl), i.e., 



E 



s 1 (s 2 +s 3 )+s 2 s 3 



p(k\si,S2,s 3 ) < 6, (21) 



which is (f2"0|> for /(s) = si(s2+S3)+S2S3. The Svetlichny in- 
equality is not a facet Bell inequality for the LHV polytope C 
in the (3, 2, 2) setting [ 1 1 but it still captures very interesting 
phenomena. In particular, the Svetlichny inequality captures 
correlations that are consistent with LHV theories if one aver- 
ages over the measurement settings and outcomes for any one 
of the parties. 

The non-trivial Bell inequalities ( |2"0| i can be seen to truly 
capture the inability of LHV theories to evaluate non-n-partite 
linear functions via correlators. A way of emphasizing this is 
to consider these inequalities as a "non-local game" 0. That 
is, given an input s to n parties that do not communicate, what 
is the maximum probability of the parties producing outcomes 
m such that X)j=i m j = /( s ) f° r some function /(s) using 
different strategies (or resources such as shared randomness or 
entanglement). A referee chooses the input and then obtains 
all outputs m before computing m r 

If the referee's choice of input is uniformly random, then 
the inequality in ( pO) captures the mean success probability of 
an LHV strategy performing the function /(s); one just needs 
to divide both sides by c" to obtain a proper probability. If one 
imagines that the choice of input s is not chosen at random but 
instead with some probability p(s) > such that J2sP( s ) = 1 
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then we can modify ( |20] > to be: 

d-l 

E p(s) E 5 /( S) p( fc i s ) < su p E p( s )$o ' < 22 > 

s k=0 9( s ) s 

to get the mean success probability of performing the function 
/(s) with LHV correlators given a distribution p(s) on inputs. 
We now present the following proposition that shows an 
infinite class of non-trivial Bell inequalities can be easily 
generated. 

Proposition 6. Any Bell inequality of the form ( |22| > is non- 
trivial whenever p(s) ^ for alls. 

Proof. For any n-partite linear function, 

Ep( s )^) < Ep( s ) = 1 (23) 

s s 

f (s) 

as there must be a value of s such that 5 , ( — 0. Therefore the 
right-hand-side of ( |22| i is strictly less than 1 for all correlators 
in C. However, for the correlator p(k — /(s)|s) = 1 in V, the 
left-hand side of |22} is exactly 1 . □ 

The beauty of these non-trivial Bell inequalities is that they 
are easily generated as one can easily describe the n-partite 
linear functions for any setting (n, c, d), yet they still bound 
the region of LHV correlators. They also explicitly capture 
the computational aspect of a Bell inequality experiment: the 
value of the left hand side of |22} gives the mean success prob- 
ability of performing a particular function. 

We now consider the facet Bell inequalities that completely 
characterize the LHV polytope for small values of (n, c, d). 
Whilst these latter inequalities do not have the immediacy of 
the non-trivial Bell inequalities and are difficult to calculate, 
they are still the optimal tool for demonstrating non-classical 
correlations. 



B. Facet Bell inequalities 

The facet Bell inequalities satisfy the facet-defining condition 
for LHV correlators and so if a correlator is outside the region 
of LHV correlators it must necessarily violate one of the facet 
Bell inequalities. In Tab. [I]we listed the number of facet Bell 
inequalities for a few settings that could be computed using 
the software Polymake. Included in the number of facet Bell 
inequalities are the c" normalization and (d — l)c" positivity 
inequalities that define the polytope V. Despite these dc n in- 
equalities, there are still a significant number of inequalities 
remaining. Also note we are again discussing correlators in 
the space V of all correlators except p(0|s) and C is the sub- 
region of LHV correlators in this space. 

We can reduce the number of facet Bell inequalities by con- 
sidering relabelings of the measurement choices, outcomes 
and parties, which we refer to as symmetries. If a Bell in- 
equality can be changed into another inequality by such an 
operation, we refer to them as equivalent, or as members of 
the same symmetry class. Explicitly, the symmetries are 



n 


c 


d 


# Symmetry classes 


2 


2 


2 


1 


2 


2 


3 


1 


2 


2 


5 


4 


3 


2 


2 


4 


3 


2 


3 


62 


2 


3 


2 


1 



TABLE II: The number of symmetry classes for each setting 
(n, c, d). We have excluded the symmetry class of all positivity and 
normalisation inequalities from this number. 



1. permutations of parties - {s,-, Sj, s n } —> 
{si> , Sji , s n /} where k' = cr(k) is an element 
of the permutation group of order n 

2. relabeling of measurement settings - Sj —> Sj + ctj for 
some a.j E Z c 

3. relabeling of measurement outcomes - rrij — > rnj + 

P( s jJ) where P(SjJ) G 

The n-partite linear functions are closed under all of these op- 
erations. Using the facet-defining condition, the vertices of 
C that saturate a facet Bell inequality must be equivalent to 
another set of vertices in C by these operations that saturate 
another facet Bell inequality. Hence we can group together 
the facet Bell inequalities into these symmetry classes. 

There are nl permutations of n parties and c" ways of rela- 
beling measurement settings. Since for each input Sj we add 
a value /3(sj), for each input s, (3(s) — Y^j=i fi( s j) i s added 
to Yjj=i m j- There will be at most d cn values of /3(s). 

We constructed a search algorithm to find all the symme- 
try classes of facet Bell inequalities for each setting (n, c, d). 
In Tab. [II] we list the number of symmetry classes for each 
of the settings in Tab. [I] which is dramatically fewer than the 
total number of facet Bell inequalities. One of the symmetry 
classes for each setting is the class of normalization and pos- 
itivity conditions, leaving all the facet Bell inequalities that 
can be violated by LHV correlators outside of C We have 
therefore omitted the class of these positivity and normaliza- 
tion inequalities from Tab. |TTJin order to leave the inequalities 
that can be violated. 

We now look at a few particular families of facet Bell in- 
equalities for the bipartite and tripartite setting based on the 
CGLMP inequality. We briefly discuss the (2, 3, 2) setting 
and show how the CHSH inequality is essentially the only rel- 
evant facet inequality for this setting. 

Bipartite Two-Setting Facet Bell Inequalities - A symmetry 
class of note for n = c = 2 is the class of facet Bell inequal- 
ities that are equivalent to the CGLMP inequality lfl4l . We 
can rewrite this inequality after symmetry transformations and 
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mapping it into our normalized probability space V to obtain, 

2 

Ccglmp = d X p(E m 3 = 1 1 0, 0) — 
2 

£(-ir+ s =KE m '- = 1 i s )+ 

s j = l 

d-1 2 

E(- 1 ) S1+S2 E( d - k l )p (E m ^ = fc ' s ) ^ d - 

s fc=2 j = l 

(24) 

This inequality is also equivalent to the CHSH inequality for 
d = 2 which we now write in the space V for clarity: 

2 

c d =2 = Et-^xE m ^ = ^ - 2 - (25) 

s j=l 

For all possible correlators in V, the maximal value of the 
left-hand-side of the CGLMP inequality is 2c? — 1, thus vi- 
olating it. In fact, for all d, this maximal violation of the 
CGLMP is obtained by a vertex of V corresponding to the 
map f(s) = S1S2 + 1, i.e. the correlator p(k\s) — 6* g +1 . 
Masanes has shown that the only non-trivial, facet Bell in- 
equalities are those that are equivalent to the CGLMP for the 
(2, 2, 3) setting ED); this is confirmed by Tab. |n] 

For the (2, 2, 2) setting there are 2 4 — 2 3 = 8 non-n-partite 
linear functions and also 8 inequalities in the symmetry class 
of the CHSH inequality. This is no coincidence as every Bell 
inequality in the symmetry class in maximally violated by a 
vertex of V corresponding to a non-n-partite linear function. 
This also occurs for the (2, 2, 3) setting, there are 3 4 — 3 3 = 54 
non-n-partite linear functions and 54 inequalities in the sym- 
metry class of the CGLMP inequality. It can also be checked 
that every inequality in this symmetry class is violated by a 
different non-n-partite linear function. 

The correspondence between non-n-partite linear functions 
and facet Bell inequality echoes the non-trivial Bell inequal- 
ities described by the inequalities of the form ( p0| ). There is 
a computational aspect to each inequality as a violation indi- 
cates that a correlator is in some sense closer to the non-n- 
partite linear functions. However, the correspondence breaks 
down for the (2, 2, 5) setting where there are 5 4 — 5 3 = 500 
non-n-partite linear functions but 1000 non-trivial, facet Bell 
inequalities in total. In this instance, each vertex of V max- 
imally violates two facet Bell inequalities (each belonging to 
different symmetry classes). Therefore there are 1000 facet 
Bell inequalities (see Appendix[A|for more details). 

Tripartite Facet Bell Inequalities - We have given an indi- 
cation that facet Bell inequalities for n = c = 2 have a com- 
putational interpretation. Every facet Bell inequality we have 
found is maximally violated uniquely by a vertex of V. In 
this sense the violation of a facet Bell inequalities can quan- 
tify how computationally powerful a theory is. For situations 
with n > 2, this becomes more complicated even for n = 3 
and c = d = 2. The Mermin inequality [41] for example is in 



the symmetry class of the inequality 

3 

E^(-irxE m ^ = 1 i s )^ 2 - w 

s j=l 

Two vertices of V that are not vertices of C maximally violate 
this inequality. The two maps that correspond to these vertices 
are /(s) = S1S2S3 + 1 and f(s) = S1S3 + 1. Therefore 
there is no longer a one-to-one correspondence between facet 
inequalities and the vertices of V that violate the inequality. 

The CHSH inequality expressed in terms of the expectation 
value of measurements (see next subsection) for two parties 
can be used to build facet Bell inequalities for more parties 
ifTUl such as the above inequality ( |2"6| i. Analogously, we define 
CGLMP for three parties using the two party inequality. We 
have three parties but now we only consider non-zero terms in 
a Bell inequality when the third party's measurement setting 
S3 = 0. ForLHV correlators p(k\si, S2, 0) the n-partite linear 
functions that can be achieved are /(s) = [a\Si -Y0L2S2 + 03]3 
with 01,02,03 € Id'- the n-partite linear functions on two 
variables Sj and s 2 . Since the CGLMP inequality is facet- 
defining for the region of LHV correlators for two parties, or 
variables s\ and S2, it is facet-defining for this space of the 
n = 3 correlators for S3 = 0. Then we can write the tripartite 
CGLMP inequality as 

3 

C'cGLMP = dx P(E m l = 1 ' ' ' )~ 

i=i 

3 

^(-l) s ^KE m i = 1 l s i' s 2>0)+ 

s j = l 

d-l 3 

j2(-iy^ J2( d - k - !)p(E m i = fc l Sl ' S2 > °) ^ d - 

s k=2 j=l 

(27) 

For the case of (3, 2, 3), this tripartite CGLMP inequality is 
facet-defining and forms a symmetry class with 324 inequali- 
ties. This is also true for d = 2 where C' CGLMP results from 
the CHSH inequality and forms one of 4 non-trivial symme- 
try classes. Also in this case there is another way of substi- 
tuting the CHSH inequality to obtain the class containing the 
inequality |26|. This class also generalizes to the d = 3 set- 
ting so that the following inequality is against the space where 
Si = S2 in a similar fashion to C CGLMP : 

3 

Ccglmp = dy.p{^ mj = 1|0,0,0)- 
3 

E^(-ir i+S3 KE TO i = 1 i s )+ 

S 3=1 
d-l 3 

E*s(-i)* i+,i E( rf - k - !ME^ = k & ^ d - 

s fc=2 j=l 

(28) 
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This inequality also forms a symmetry class with 324 inequal- 
ities. There are 60 other symmetry classes for (3,2,3) (ex- 
cluding the class of positivity and normalization inequalities). 
Inequalities from each of these classes can be found in the 
supplementary material |42|. 

Bipartite Three-Setting, Two-Outcome Facet Inequalities - 
Finally, as can be seen from Tab. |n] f or the (2,3,2) setting 
there is only one symmetry class of non-trivial facet Bell in- 
equalities. The Bell inequality generating this symmetry class 
is a generalization of the CHSH inequality: 

2 2 

c c=3 = 5>ir^ + ^xe™, = lis) < 2. 

S 3=1 j=l 

(29) 

This is exactly the same as the CHSH inequality if instead of 
c = 3 we had c = 2. In fact for either c or d equal to 4 and 
n = 2, inequalities constructed from the CHSH inequality 
capture a lot of the structure of the LHV polytope. 

In this subsection we have given an indication of the rich- 
ness of the structure of the LHV polytope for some simple 
settings. There is also a computational element to some of 
these Bell inequalities, such as the CGLMP inequality which 
is maximally violated by a vertex of V, in that greater viola- 
tions enable correlators to come closer to evaluating a non-n- 
partite linear function deterministically. Quantum correlators 
are known to provide violations of Bell inequalities and so in 
the next section we discuss quantum correlators. 



C. Quantum correlators 

Quantum correlations can violate all manner of Bell inequali- 
ties if the correlations are generated from measurements on an 
entangled state. However, the connection between a violation 
of a facet Bell inequality and entanglement is not completely 
clear. Recently the two concepts have become very distinct 
and nowhere is this best demonstrated by the CGLMP in- 
equality where the greatest violation of the CGLMP inequal- 
ity is not achieved with a bipartite maximally entangled state. 
The region Q C V of quantum correlators is convex, but it 
is not a polytope since there are an infinite number extreme 
points lfl2ll . It is not clear in general how to define all of the 
extreme points of Q. We now discuss this region in the context 
of our CGLMP-like Bell tests. 

Without loss of generality, we can assume that the measure- 
ments at each site are projective measurements by Naimark's 
theorem ll43l . so that for a quantum state p € (H)® n , the 
correlators are 

n 

m j=l 

As p can be expressed as a convex combination of pure states, 
we can, for the purposes of finding the maximum quantum 
violation, assume that p is a pure state. 

A more compact way of expressing Bell inequalities is in 
terms of the expectation values of joint measurements M Sj 



lfT5ll35l . which, for the above projective measurements, are 

n 

E(s) = tr(p0M Sj ) 

n 

= EeT(E^^) tr (p(g) \ mj ) Sj { mj \ Sj ) 

m j=l 
d-l 

= ^e^ps^ls). (31) 

k=0 

The correlators pq(A;|s) in ( |3"Tj i can be replaced with LHV 
correlators to obtain the expectation value of measurements 
for LHV theories. 

Our construction of correlators therefore has a natural role 
in the construction of expectation values E(s). If we have a 
Bell inequality in terms of correlators such as ( fT8j ), it is possi- 
ble to relate it to a Bell inequality in terms of these expectation 
values by taking the discrete Fourier transform 11351 . The dis- 
crete Fourier transform of ( fTS) gives 

d-l 

EE^, s [E(s)]"< 7 £ (32) 

s fj,=a 

with 7£ as defined in ( fT8] l if the complex pre-factors 77^ are 

d-2 

iw = jE w *' e (33) 
fe=i 

This construction is another motivation for considering corre- 
lators as opposed to the full distribution p(m|s). 

To obtain the maximal violation of a Bell inequality by 
quantum correlators, we optimize ([32]) over pure states p and 
unitary operators M Sj (which correspond to projective mea- 
surements). For the case of (n, 2, 2), all facet Bell inequal- 
ities are maximally violated by the GHZ state \GHZ) = 
4=(|0)® n + \l)® n ) QO). In general, finding the maximal vi- 
olation is a difficult problem, however, there are methods of 
providing numerical lower and upper bounds on this quantum 
violation (see e.g. l45l and |[T5ll ). We will now discuss and 
utilize these methods to find the maximal quantum violations 
of facet Bell inequalities for two parties. 

First we discuss methods of finding a lower bound on a 
two-party maximal quantum violation used in |[T5l and ll46l . 
The quantum state is first fixed as the d 2 -dimensional max- 
imally entangled state \9) — ^5^j=o an< ^ we °Pti~ 
mize over the unitaries M 8j . More specifically we write 
the projectors as \rrij) Sj (rrij\ Sj — M Sj \k)(k\MJ., where 

{\k)\k G 1 d } is the standard basis of U D . The M Sj 
can be written as M s . = FD Sj where F is the d-by- 
d Hadamard, or Quantum Fourier Transform matrix and 
D s . = diag^ 1 ^'), e'^M, e^M), a diagonal matrix 
with <fij(sj) as real phases. Therefore we optimise over these 
phases <f>j(sj) to numerically maximize the quantum violation 
for the maximally entangled state. 

Once we find the optimal phases (sj ) for each Sj, we op- 
timize over the pure states, p, by finding the largest eigenvalue 
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n 


c 


d 


Symmetry class 


LHV bound 


Quantum bound 


2 


2 


2 


Cd=2 


2 


2.4142 1 " 


2 


2 


3 




3 


3.9149 


2 


2 


5 


Xi 


5 


6.3145 


2 


2 


5 




5 


7.6290 


2 


2 


5 


x 3 


5 


7.0314 


2 


2 


5 


CcGiMP 


5 


7.0314 


2 


3 


2 


C c =3 


2 


2.4142 f 



The non-trivial Bell inequalities that we have constructed in 
this section have an explicit computational meaning. We will 
explore these inequalities in the context of non-local games 
and Measurement-based Quantum Computing. More gener- 
ally we consider how much of the structures developed in 
this paper can be applied to these information processing sce- 
narios. Whilst a lot of ideas have been generalized from the 
(n, 2, 2) setting, we show that LHV correlators are more pow- 
erful when c,d > 2. 



TABLE HD We list the bipartite maximal quantum violations for par- 
ticular facet Bell inequalities for c and d. The inequalities T\, Ti and 
Is can be found in the appendix|A| Those violations that are achieved 
with the bipartite maximally entangled state of d 2 dimension are la- 
beled with a t. 



of the operators corresponding to ( |32| i. The largest eigenvalue 
then corresponds to the eigenvector Using this method 
it has been shown that the maximal violation of the CGLMP 
inequality is achieved with a state that is not maximally en- 
tangled Il46l . 

One method of providing an upper bound to the violation of 
a Bell inequality is the use of semi-definite programming |45 1. 
A Gram matrix is constructed from the expectation value of 
products (or sequences) of projectors and this matrix is pos- 
itive semi-definite. The Bell inequality is then a linear ob- 
jective function on this matrix subject to linear constraints on 
elements of the Gram matrix. Navascues et al constructed a 
hierarchy of semi-definite programs that converge to the set of 
quantum correlations and also constructed a method for test- 
ing for convergence B31 . 

We used YALMIP and SeDuMi [47 1 to implement these 
semi-definite algorithms to find the upper bound to quantum 
violations whilst finding lower bounds using previously de- 
scribed methods. In Tab. Ill we list the maximal quantum 



violations of the facet Bell inequalities in symmetry classes 
described previously; a single value is listed as the upper and 
lower bounds coincide differ by at most 10~ 9 , which is consis- 
tent with numerical error. Interestingly, of the settings studied, 
only Bell inequalities for qubits are maximally violated by a 
maximally entangled bipartite state. 



D. Summary 

Dual to a vertex description of a convex polytope is the facet 
description where facets are defined by linear inequalities. We 
have given an insight into some of the complicated structure 
behind these facet Bell inequalities. Despite the complicated 
structure, there is a computational insight into the facet Bell 
inequalities for particular settings. The CGLMP inequalities 
are computational in nature and can be used to construct facet 
Bell inequalities for multipartite scenarios. We then showed 
that these and other bipartite facet Bell inequalities are vio- 
lated by quantum correlators, which indicates that quantum 
mechanics offers a computational advantage relative to any 
LHV theory. 



IV. BELL TESTS, NON-LOCAL GAMES AND QUANTUM 
COMPUTING 



Connections have been made between Bell tests and 
Measurement-based Quantum Computing (MBQC) (6] 0. 
This connection has been explicitly explored in the setting 
where each party has a choice of two measurements, each with 
two outcomes. Furthermore, the role of post-selection in Bell 
tests simulating MBQC has been explored in this setting [7] 
leading to novel quantum phenomena. In this section we will 
give an overview of all of these connections and then discuss 
their implementation in general (n, c, d) settings. So much of 
the computational insight in constructing LHV correlators in 
the (n, 2, 2) setting carries over into the (n, c, d) setting, but 
the structure of LHV correlators is also richer. This richness 
also means certain results do not generalize. 

We have already discussed non-local games in the context 



of the non-trivial Bell inequalities in section III There is an 
overlap between non-local games and MBQC if we consider 
the elements in both information processing scenarios. Firstly 
in MBQC there are a number of sites that share a particular re- 
source (e.g. the cluster state ll48l for quantum computing), and 
single-site measurements are made on this resource. All mea- 
surement data is processed by a classical computer and in or- 
der to achieve a universal quantum computer in current mod- 
els, adaptive measurements are required; adaptivity means 
choices of measurements are informed by previous measure- 
ment outcomes. In the model developed by Raussendorf and 
Briegel, the classical computer only needs to be able to per- 
form linear Boolean functions to achieve a universal quantum 
computer |48 1. 

We now recall the model of non-local games as discussed 
earlier [5 1. There are a number of parties who share some re- 
source state but do not communicate with each other. These 
parties receive an input from a referee and send an output 
back. The referee processes the outputs to see if the par- 
ties successfully performed some task. MBQC can be recast 
as a non-local game where the parties choose their measure- 
ment based upon the input they receive and send the referee 
the measurement outcome. The referee in MBQC is a clas- 
sical computer who processes the measurement data in order 
to achieve some task. Therefore the referee performs linear 
Boolean function computation on the measurement data de- 
scribed by bits [8|. 

MBQC has been generalized to include more than a choice 
of two measurements and two measurement outcomes at each 
site J49 1, thus measurement data is no longer encoded in bits. 
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The classical computer processing measurement data in these 
models just uses addition modulo d where d is the number 
of measurement outcomes at each site l49l . Addition modulo 
d on data is still an extremely limited form of computation, 
even compared to the n-partite linear functions discussed in 
this paper. 

For simplicity we say that the data sent and received from 
the measurements sites is encoded as digits in Z^. The spe- 
cific model of MBQC as a non-local game that we consider 
consists of three stages: 

Ixl 

1. A digit-string x € Zj, 1 of length |x| is processed by a 
classical computer, which then sends a single digit to 
each site after processing; 

2. At each site, this digit is used to select a measurement 
and the outcome is sent to the classical computer; 

3. The classical computer processes this measurement 
data to output a single digit. 

The processing power of the classical computer is limited to 
addition modulo d. 

The input at each site Sj = h(x) is now a result of some 
pre-processing leading to a function h(x) on the digit-string x 
where |x| < n. This function is limited to addition modulo 

d on x, i.e. h(x) — Yj*=i a j x j where &j € Z^. Then the 
processing on measurement outcomes rrij leads to the output 
function Y^=i m j without loss of generality. 

The non-trivial Bell inequalities described in this paper cap- 
ture the mean success probability of LHV correlators evaluat- 
ing a non-n-partite linear function /(s) for the input s. We 
now investigate how the construction of Bell inequalities is 
modified by this new element of pre-processing on a digit- 
string x. The pre-processing allows us to express the correla- 
tors as being conditioned upon x, i.e., p(k\x) :— p(fc|s(x)). 

The correlators p(k\x) are now elements of a vector p in 
now (} escr ibe the structure of the region of all 
possible correlators. 

For the digit-string x if we set n = \x\ and Sj = xj for all 
j, then the possible correlators p(k\x) live in the space V for 
this setting. If we increase n and consider other forms of lin- 
ear pre-processing, we allow more freedom to evaluate more 
complicated functions. As shown in [8 1 for the pre-processing 
described above, quantum correlators for d = 2 = c ben- 
efit from this freedom. That is, for d = 2 = c settings, 
all deterministic correlators p(k\x) corresponding to functions 
f(x) : z|*' — > can be achieved for some sufficiently large 
n. 

Remarkably, LHV correlators p(k\x) ford — 2 are un- 
affected by increasing n using the pre-processing described 
above [7j. The deterministic correlators, or vertices of V 
for p(k\x) that LHV correlators achieve are always the lin- 
ear Boolean functions. In this way, the computational power 
of LHV correlators is not boosted by increasing n and pre- 
processing for d = 2. We now show that for general d > 2 
this no longer holds and computational power can be boosted. 
We actually show something stronger: if any form of the pre- 
processing described above with Sj — h(x) and |x| — n is al- 



lowed, then the computational power of LHV correlators will 
be boosted. 

Proposition 7. For arbitrary addition modulo d pre- 
processing on x leading to measurement settings Sj = h(x), 
the space of LHV correlators p(k\x) is not confined to the con- 
vex hull of n-partite linear functions on x for d > 2. 

Proof. As before, we consider the deterministic maps p(k\s) 
and then take their convex hull. However, we now consider 
the effect of pre-processing on x and make the assumption 
that d > 2. When each party receives the input Sj generated 
by the pre-processing, the input is Sj = h(x) where h(x) = 

Sj=i a j x j with ctj € Zrf. We now map into the space of 
all correlators p(k\x) under the influence of all possible pre- 
processing of the form h(x). 

For LHV correlators, we need to consider all single site 
maps %d. — > Zd which can be written as a polynomial over 
Z d as: 

(d-i) (d-i) 

mj= £ £ ^- (Z+1) +7, (34) 

y=l z=0 

where e y<z = (3 y (-l) z+1 { d ~ Y )y z and 7 , f3 y e Z d . If we 

add in the pre-processing stage where Sj — Y?,j=i a j x j th en 
single site maps become polynomials in elements of Xj, i.e. 

( z+1 > — (Y^fli o-j x j) d ~ <z+1 ). Therefore for appropri- 
ately chosen fi y and ctj, there are now cross-terms between 
elements of Xj, e.g. £1X2 etc if d > 2. Because of this the 
deterministic single-site maps with pre-processing cannot be 
described as n-partite linear functions on x. □ 

If we just have sj — xj and have n = \x\ parties then 
the LHV polytope is just the convex hull of n-partite lin- 
ear functions on x. Pre-processing therefore can boost the 
computational power of LHV correlators to go beyond this 
computational description. However, not all polynomials of 
elements Xj over Z^ can be achieved by taking powers of 

the linear functions 5^=1 a j x j- F° r example, the function 

/(x) = Ylj=i x< j~ X cannot be achieved by this method. More 
generally, any function that contains a term that is a non-n- 
partite linear function of degree greater than d — 1 cannot be 
achieved in any LHV theory for any value of n. So whilst 
the LHV correlators are boosted by this pre-processing, the 
resulting LHV polytope does not encompass the convex hull 
of all functions /(x). 

Knowing that the boosted LHV polytope cannot achieve a 
function such as /(x) = nj/=i x j~ 1 > we can construct a non- 
trivial Bell inequality for this function for all n > |x|. Instead 
of finding the upper bound for LHV correlators by taking the 
supremum over all n-partite linear functions, we now have to 
take the supremum over all possible linear pre-processing for 
all n maps of the form (34) . This added complication high- 
lights the uniqueness and simplicity of the case for d = 2, 
where one only needs to take the supremum over all linear 
Boolean functions of x even with pre-processing. 
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In the case where the number of possible inputs at each 
site is different from the number of possible outputs, there are 
other complications. We illustrate this with an example in the 
(3, 2, 3) setting but with a bit-string x € Z|. We have three 
parties with pre-processing on x leading to inputs: si = x\, 
S2 = X2 and S3 = [x\ + x 2 ] 2 - With this pre-processing one 
can achieve the non-n-partite linear function /(x) = X\X 2 
with the n-partite linear function /(s) = 2s 1 + 2s2 + S3 and 
this pre-processing as 2x\ + 2x 2 + [x\ + x 2 ] 2 = S^ lX2 , There- 
fore even with c = 2 and d > 2, the LHV polytope is also no 
longer confined to n-partite linear functions on x. 

The effect of processing on LHV correlators in the general 
framework of Bell tests is pointedly different from the two- 
measurements, two-outcome case. We hope to illustrate that 
there is a richness in structure in how information processing 
affects classical correlations and an appreciation of the con- 
nections between computation and correlators to give a handle 
on the effect of processing. In Q, the effect of processing can 
be seen to have a role in creating so-called loopholes in Bell 
tests and it is worthwhile exploring loopholes in the frame- 
work of data processing described here. 

We also hope these results really emphasize why the 
(n, 2, 2) setting is so special and such a powerful platform 
for distinguishing between classical and non-classical corre- 
lators. Even with linear Boolean pre-processing, the classical 
computation possible with LHV correlators is well character- 
ized. This is still a computationally interesting setting as two 
inputs and two outputs at each site already enables quantum 
computing and the richness of structure that comes with that. 
Despite the fact that MBQC can be generalized to the setting 
with more inputs and outputs at each site, the same LHV struc- 
ture does not retain the same character in this setting. 

V. SUMMARY AND OPEN QUESTIONS 

In this paper, we have explored the relatively under-studied 
multi-setting, multi-outcome Bell inequality test in the con- 
text of the computational perspective of Bell tests, which has 
allowed us to derive new families of non-trivial Bell inequal- 
ities. We have presented our work in terms of correlators and 
not the full measurement statistics of an experiment for clarity 
and convenience. However, we have shown that a significant 
amount of the structure of the space of non-signaling proba- 
bility distributions are captured by this setting. 

We have shown that LHV theories can be generally con- 
sidered in terms of limited computational expressiveness, i.e. 
LHV theories can only evaluate n-partite linear functions as 
defined. A violation of a Bell inequality defined by these 



functions implies a computational advantage relative to any 
LHV theory. Given this interpretation, we can construct an 
infinite class of non-trivial Bell inequalities. Furthermore we 
have explicitly constructed facet Bell inequalities that define 
the region of LHV correlators using the construction of LHV 
correlators in this paper. 

Central to this paper is the incorporation of a computational 
perspective using modulo arithmetic into the more established 
picture of the convex polytopes of LHV theories. This per- 
spective provides a broad arena to explore and, while we have 
given some insight, many open questions still remain. Firstly, 
what is the general correspondence between non-n-partite lin- 
ear functions and facet Bell inequalities? There seems to be 
a one-to-one or a one-to-two correspondence in certain set- 
tings but this is not always the case. There are many happy 
coincidences that need further clarification. 

A huge open question is what exactly defines the quantum 
region in our space of correlators? For the (n, 2, 2) correla- 
tor Bell test, we know that measurements on GHZ states form 
the extreme points of the quantum region ifTUl fTTI . However, 
little is known for other settings. At the moment finding the 
eigenvectors for maximal violations of a Bell inequality has 
not resulted in a general expression for the states that max- 
imally violate the CGLMP inequality, if such a closed form 
expression even exists. In addition, more study is needed to 
determine the effect of pre-processing on quantum correlators. 

In ||7), signaling correlators were simulated with post- 
selection on measurement data. We have shown that the gen- 
eralization of this post-selection leads to a boost in the power 
of LHV correlators; something that is akin to a loophole in a 
Bell test. However, in the more general setting, does simu- 
lating signaling correlators also boost LHV correlators? Are 
there more nuanced ways of processing data that does not ex- 
pand the region of LHV correlators as occurs in Proposition 

If a violation of a Bell inequality is a resource for informa- 
tion processing [2-4, 6 8|, then all aspects of Bell tests need 
to be explored to determine how we can best exploit this re- 
source. We have tried to hint at how the richness of structure 
of abstract local hidden variable theories can be captured by 
a simple computational picture. We hope that this work gives 
more insight into the cooperative relationship between quan- 
tum information and the foundations of quantum mechanics. 
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Appendix A: Facet Bell inequalities 

In this section we list the facet Bell inequalities which be- 
long to symmetry classes that are not explicitly described in 
the paper. We express the correlators p(k\s) as the elements 
of a (d — l)c n vector in real space IR( d-1 ) c ". To simplify 
we describe the inequalities in terms of a vector b that has 
inner product with correlator column vectors p to form the 
Bell inequality, i.e. b ■ p < jc- The coefficients corre- 
spond to elements going from left-to-right as p(l|0, 0, 0), 
p(2|0,0,...,0) up to p{d- l\c - l,c- l,...,c- 1). We will 
use this notation in the next two sections of the appendix. For 
the (2, 2, 5) setting, there are 4 symmetry classes of facet Bell 
inequalities (excluding the class of positivity and normaliza- 
tion inequalities). One of these is the CGLMP inequality and 
the other three are given by the vectors of coefficients 

h = \ (6, 2, 3, 4, 4, -2, 2, 1, 4, -2, 2, 1, -4, 2, -2, -1) , 

b 2 = (3,1,-1,-3,2,-1,-4,-2,2,-1,-4,-2,-2,1,4,2). 

b 3 = (2,-1,1,-2,3,1,-1,2,3,1,-1,2,-3,-1,1,-2). 

(Al) 

Denoting by Z, the inequality corresponding to bj , T\ and the 
CGLMP inequality are maximally violated by the vertex cor- 
responding to f(s) = S1S2 + 1 and I 2 and Z3 are maximally 
violated by the vertex corresponding to /(s) = 2sis 2 + 1. 

The LHV bounds for all of the inequalities are listed in Tab. 
Ill with their quantum violation. 

As we have seen there is a corresponding function for each 
of these inequalities that leads to a maximal violation. Each 
of these functions is a vertex in the general space of Bell cor- 
relators mentioned in the paper. 



Appendix B: Generalization to non-prime number of settings 

In this paper, we focused on the case where c and d are prime. 
However, this assumption is not needed to prove all the re- 
sults. We now give a quick overview of how we describe LHV 
correlators in terms of polynomials on inputs for non-prime 
c. For non-prime number of settings, the number of inputs 
at each site c can be just expressed as the Cartesian product 
of the prime factors of c, i.e. Z c — > Z Cl x Z C2 x ... x Z Cq 
where {ci, c 2 , c q } is the set of prime factors of c and q is 
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TABLE IV: The facet Bell inequality expressions that each belong to 
a particular symmetry class for (2, 4, 2). Each row corresponds to a 
particular inequality belonging to a different symmetry class. Each 
column of b is an element of this vector that forms an inner product 
with p. The LHV upper bound for inequalities B\, B2, B3, B4, and 
B 5 is 8 and 4 for B 6 , B 7 , B 8 , B 9 , Bio, and B u . 



the number of prime factors in c. As a result each input Sj 
can be represented as a string of elements Sj — {tj, t|, i'} 
where t) E Z Cfc . 

For LHV theories we consider the single site maps rrij and 
then take their sum modulo d. For non-prime dimension then, 
each deterministic single site map can be written as 



ij = aij(\) + 



E 



(Bl) 



where <x, (A) € and /3J(A) <E Id dependent on the local 
hidden variable A and = {0, 0, 0)}. We can use the re- 
definition of the delta function over Z c — > Z Cl x Z C2 x ... x Z Cq 
used for each prime dimension Z Ck . This results in the map: 



m a )+ e I 1 



M(ck-i) 



ve2 

1^0 



k=l 



(B2) 

One then takes the sum modulo d of all of these single site 
maps to obtain (a superficially complicated) map which is n- 
partite linear as it only consists of single-site maps. There is 
also no multiplication between values of i* with other values 

of tj, for j 7^ j'. Again there are d' i ( c_1 )+ 1 possible deter- 
ministic maps for LHV theories, and their convex hull forms 
the LHV poly tope C. 

The smallest value of non-prime d is 4 and so we studied the 
structure of the LHV polytope for the (n, 4, 2) setting which 
has 2 7 n-partite linear functions corresponding to the vertices 
of the LHV polytope. There are 27, 968 facet Bell inequalities 
for this setting found with Polymake [39 1 which reduces to 15 
symmetry classes (14 excluding the class of normalization and 



positivity inequalities). Three of these classes are forms of the 
CHSH inequality embedded in the larger number of inputs. 
For completeness, we have listed all 14 Bell inequalities in 



Tab. IV We now explicitly write out one of these inequalities: 



2 2 

cu = E(- i ) sis2 iw + <^me^ = !i s ) ^ 2 - 

8 3 = 1 j = l 

(B3) 

which is almost exactly the same as C^, =3 . The other two in- 
equalities, C^L 3 and Cg =3 are similar to this inequality except 
with altered delta functions for C^ =3 : 

2 

IJ (tf + 6? )^(5 s ^+5^) (6 S * +5?), (B4) 



and forCjL 3 : 



II (# + S i )^(So 1 +S S 2 1 ) (^ 2 + 5? )■ (B5) 

Throughout the whole of the paper, we have assumed that d 
has been prime. Indeed, this fact has been vital in the proof of 
Theorem |2j so this proof fails for non-prime d. However, for 
other aspects of the methods in this paper, a non-prime d has 
little consequence for our construction of correlators. There 
are two approaches to considering non-prime d: the most ob- 
vious approach is just to think of representing outcomes as a 
string of digits from prime number registers as with the de- 
scription of settings above; secondly, instead of having the 
elements f(s) of a function as elements of a vector in a vec- 
tor space over Zd for prime d, we now have elements of a 
module over 7Ld\ a generalization of a vector space. Since 
we only ever take addition of functions represented in a Kro- 
necker delta function basis to generate all functions, all of our 
methods generalize to these modules. 

From the aspect of functions, the only difference for non- 
prime d is the fact that f(y) in |2| can now take values from 
a non-prime register Zd- The derivation of all functions and 
ri-partite linear functions generalizes naturally. For (2,2,4) 
with the smallest non-prime d = 4, we present the facet Bell 
inequalities. There are four symmetry classes for this setting 
of which one is the class of positivity and normalization in- 
equalities. One of these three non-trivial symmetry classes 
is formed by the CGLMP inequality. The second is another 
generalization of the CHSH inequality and is of the form: 



(-iy 



p(^2 m j = !i s ) 



p(E m 3 



= 3|s) 



< 2, 



(B6) 

which is essentially the CHSH inequality if each party groups 
their outcomes rrij into modulo 2 terms. In other words, since 
1 mod 2 is equal to 3 mod 2, the above inequality is equivalent 
to the CHSH inequality if each party just maps from mod 4 
arithmetic to mod 2. 
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The third and final symmetry class is generated by the fol- 
lowing inequality (expressed in the notation described ear- 
lier): 

(l,2,l,l,2,l,l,2,l,-l,-2,-l)-p<4. (B7) 
The deterministic map or function /(s) that achieves the max- 



imal upper bound 6 of this inequality for all correlators in 
V is /(s) = 2sis 2 + 2. It is worth noting that this can be 
constructed by adding J2 S 2(— l) Si;S2 p(2|s) to the previous in- 
equality. 



